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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Leu Leu Asn Val Leu Arg lie Cys lie lie Val Cys Leu Val Asn 
15 10 15 

Asp Gly Ala Gly Lys His Ser Glu Gly Arg Glu Arg Thr Lys Thr Tyr 
20 25 30 

Ser Leu Asn Ser Arg Gly Tyr Phe Arg Lys Glu Arg Gly Ala Arg Arg 
35 40 45 

Ser Lys lie Leu Leu Val Asn Thr Lys Gly Leu Asp Glu Pro His lie 
50 55 60 

Gly His Gly Asp Phe Gly Leu Val Ala Glu Leu Phe Asp Ser Thr Arg 
65 70 75 80 

Thr His Thr Asn Arg Lys Glu Pro Asp Met Asn Lys Val Lys Leu Phe 

85 90 95 

Ser Thr Val Ala His Gly Asn Lys Ser Ala Arg Arg Lys Ala Tyr Asn 
100 105 110 

Gly Ser Arg Arg Asn lie Phe Ser Arg Arg Ser Phe Asp Lys Arg Asn 
115 120 125 

Thr Glu Val Thr Glu Lys Pro Gly Ala Lys Met Phe Trp Asn Asn Phe 
130 135 140 

Leu Val Lys Met Asn Gly Ala Pro Gin Asn Thr Ser His Gly Ser Lys 
145 150 155 160 

Ala Gin Glu lie Met Lys Glu Ala Cys Lys Thr Leu Pro Phe Thr Gin 

165 170 175 

Asn lie Val His Glu Asn Cys Asp Arg Met Val lie Gin Asn Asn Leu 
180 185 190 

Cys Phe Gly Lys Cys lie Ser Leu His Val Pro Asn Gin Gin Asp Arg 
195 200 205 

Arg Asn Thr Cys Ser His Cys Leu Pro Ser Lys Phe Thr Leu Asn His 
210 215 220 
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Leu Thr Leu Asn Cys Thr Gly Ser 
225 230 

Met Val Glu Glu Cys Thr Cys Glu 

245 

Thr Ala Gin Phe Asn Met Asp Thr 
260 




Lys Asn Val val Lys Val Val Met 
235 240 

Ala His Lys Ser Asn Phe His Gin 
250 255 

Ser Thr Thr Leu His His 

265 270 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDRESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CPNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAATTCCTAA AAGCGGCACA GTGCAGGAAC AGCAAGTCGC TCAGAAACAC TGCAGGGTCT 60 

AGATATCATA CAATGTTACT AAATGTACTC AGGATCTGTA TTATCGTCTG CCTTGTGAAT 120 

GATGGAGCAG GAAAACACTC AGAAGGACGA GAAAGGACAA AAACATATTC ACTTAACAGC 180 

AGAGGTTACT TCAGAAAAGA AAGAGGAGCA CGTAGGAGCA AGATTCTGCT GGTGAATACT 240 

AAAGGTCTTG ATGAACCCCA CATTGGGCAT GGTGATTTTG GCTTAGTAGC TGAACTATTT 300 

GATTCCACCA GAACACATAC AAACAGAAAA GAGCCAGACA TGAACAAAGT CAAGCTTTTC 360 

TCAACAGTTG CCCATGGAAA CAAAAGTGCA AGAAGAAAAG CTTACAATGG TTCTAGAAGG 420 

AATATTTTTT CTCGCCGTTC TTTTGATAAA AGAAATACAG AGGTTACTGA AAAGCCTGGT 480 

GCCAAGATGT TCTGGAACAA TTTTTTGGTT AAAATGAATG GAGCCCCACA GAATACAAGC 540 

CATGGCAGTA AAGCACAGGA AATAATGAAA GAAGCTTGCA AAACCTTGCC CTTCACTCAG 600 

AATATTGTAC ATGAAAACTG TGACAGGATG GTGATACAGA ACAATCTGTG CTTTGGTAAA 660 

TGCATCTCTC TCCATGTTCC AAATCAGCAA GATCGACGAA ATACTTGTTC CCATTGCTTG 720 

CCGTCCAAAT TTACCCTGAA CCACCTGACG CTGAATTGTA CTGGATCTAA GAATGTAGTA 780 

AAGGTTGTCA TGATGGTAGA GGAATGCACG TGTGAAGCTC ATAAGAGCAA CTTCCACCAA 840 
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ACTGCACAGT TTAACATGGA TACATCTACT ACCCTGCACC ATTAAAAGGA CTGTCTGCCA 900 

TACAGTATGG AAATGCCCAT TTGTTGGAAT ATTCGTTACA TGCTATGTAT CTAAAGCATT 960 

ATGTTGCCTT CTGTTTCATA TAACCACATG GAATAAGGAT TGTATGAATT ATAATTAACA 1020 

AATGGCATTT TGTGTAACAT GCAAGATCTC TGTTCCATCA GTTGCAAGAT AAAAGGCAAT 1080 

ATTTGTTTGA CTTTTTTCTA CAAAATGAAT ACCCAAATAT ATGATAAGAT AATGGGGTCA 114 0 

AAACTGTTAA GGGGTAATGT AATAATAGGG ACTAACAACC AATCAGCAGG TATGATTTAC 1200 

TGGTCACCTG TTTAAAAGCA AACATCTTAT TGGTTGCTAT GGGTTACTGC TTCTGGGCAA 1260 

AATGTGTGCC TCATAGGGGG GTTAGTGTGT TGTGTACTGA ATTAATTGTA TTTATTTCAT 1320 

TGTTACAATG AAGAGGATGT CTATGTTTAT TTCACTTTTA TTAATGTACA ATAAATGTTC 1380 

TTGTTTCTTT AAAAAAAAAA AAAAACTCGA G 1411 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ser Arg Thr Arg Lys val Asp Ser Leu Leu Leu Leu Ala lie Pro 
15 10 15 

Gly Leu Ala Leu Leu Leu Leu Pro Asn Ala Tyr Cys Ala Ser Cys Glu 
20 25 30 

Pro Val Arg lie Pro Met Cys Lys Ser Met Pro Trp Asn Met Thr Lys 
35 40 45 

Met Pro Asn His Leu His His Ser Thr Gin Ala Asn Ala lie Leu Ala 
50 55 60 

lie Glu Gin Phe Glu Gly Leu Leu Thr Thr Glu Cys Ser Gin Asp Leu 
65 70 75 80 

Leu Phe Phe Leu Cys Ala Met Tyr Ala Pro lie Cys Thr lie Asp Phe 

85 90 95 
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Gin Hie 
Ala Gly 



Ser Leu 
130 

Ser Pro 
145 



Glu Pro 
100 

Cys Glu 
115 

Ala Cys 
Glu Ala 



Asp Phe Ser Met 
His Cys 
Asn Asn 



Lys Cys 
180 



Lys Cys 
210 

Ser Ser 
225 

Ser Gly 
Met Gly 
Ser Leu 



Trp Asp 
290 

Pro lie 
305 



Tyr Asn 
195 

His Asp 

Leu Val 

Cys Leu 

Tyr Glu 
260 

Ala Glu 
275 

Gin Lys 
Pro Asn 



lie Lys 

Pro lie 

Glu Glu 

He val 
150 

Asp Ser 
165 

Lys Pro 

Tyr Val 

Ala Thr 

Asn He 
230 

Cys Pro 
245 

Asp Lys 
Lys Trp 
Leu Arg 



Lys Asn 
310 



Pro Cys Lys 
105 

Leu He Lys 
120 

Leu Pro Val 
135 

Thr Val Glu 

Asn Asn Gly 

Met Lys Ala 
185 

He Arg Ala 
200 

Ala He Val 
215 

Pro Lys Asp 
Gin Leu Val 



Glu Arg Thr 
265 

Arg Asp Arg 
280 

Arg Pro Arg 
295 

Ser Asn Ser 



Ser Val Cys 



Tyr Arg His 



Glu Arg Ala Arg 
110 

Thr Trp Pro Glu 
125 



Tyr Asp Arg Gly Val Cys He 
140 



Gin Gly Thr 
155 

Asn Cys Gly 
170 

Thr Gin Lys 

Lys Val Lys 

Glu Val Lys 
220 

Thr Val Thr 
235 

Ala Asn Glu 
250 

Arg Leu Leu 
Leu Ala Lys 



Lys Ser Lys 
300 

Arg Gin Ala 
315 



Asp Ser Met Pro 
160 

Ser Gly Arg Glu 
175 

Thr Tyr Leu Lys 
190 

Glu Val Lys Val 
205 

Glu He Leu Lys 



Leu Tyr Thr Asn 
240 

Glu Tyr He He 
255 

Leu Val Glu Gly 
270 

Lys Val Lys Arg 
285 

Asp Pro Val Ala 



Arg Ser 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



[Page 5] 




(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: 


SEO ID NO: 4 


• 






GAATTCCCTT 


TCACACAGGA 


CTCCTGGCAG 


AGGTGAATGG 

nvjvj x \jrUi x wvj 


TTAGPCPTAT 

x invjwu \« x ** x 


GGATTTGGTT 

vjvjnx x x \3w x x 




TGTTGATTTT 


GACACATGAT 


TGATTGCTTT 


PAGATAGGAT 


TGAAGGAPTT 


GG ATTTTTAT 

VJvJri X XXXX «* X 


X £ u 


CTAATTCTGC 


ACTTTTAAAT 


TATCTGAGTA 


ATTGTTCATT 

• » X X VJ X X Vj*f* X X 


TTGTATTGGA 

X X \j X *» X X VJ VJ** 


TGGGACTAAA 

a v« a * ** ** * 


180 


GATAAACTTA 


ACTCCTTGCT 


TTTGACTTGC 


CCATAAACTA 

V^ w ** X ****** V* X «* 


TAAGGTGGGG 

X **** VJ VJ X V? VJ VJ VJ 


TGAGTTGTAG 

A ^J«*\J A A %^ J> CANJ 


240 


TTGCTTTTAC 


ATGTGCCCAG 


ATTTTCCCTG 


TATTCCCTGT 

X ** X X W W v* A VJ A 


ATTCCCTCTA 

#* X X \ar W w X V* X ** 


AAGTAAGCCT 

**** NJ X **** \J W X 


300 

*J V V 


ACACATACAG 


GTTGGGCAGA 


ATAACAATGT 


CTCGAACAAG 

V* A V^ VJ V^ A*** VJ 


GAAAGTGGAC 

Vja^JTU* VJ X VJ VJ ** w 


TCATTACTGC 

X V« ** X X ** >^ X 


360 

J V V 


TACTGGCCAT 


ACCTGGACTG 


GCGCTTCTCT 


TATTACCCAA 

X ** X X ** w V* V* ***» 


TGCTTACTGT 

X VJV* X X «*V«» X VJ X 


GCTTCGTGTG 

\J W X X W NJ X VJ X W 


420 

"I X> V 


AGCCTGTGCG 


GATCCCCATG 


TGCAAATCTA 


TGCCATGGAA 

X VJ w V* *» X vj vj**x* 


CATGACCAAG 

v^ «* x vj ** v^ v* ru*\j 


ATGCCCAACC 


480 

T O V 


ATCTCCACCA 


CAGCACTCAA 


GCCAATGCCA 


TCCTGGCAAT 

*v V^ %■# A VJ VJ V* <*** A 


TGAACAGTTT 

A VJ fUA V^ ** VJ A J* A. 


GAAGGTTTGC 

VJ •*** \J V3 XXX VJ V>^ 


540 


TGACCACTGA 


ATGTAGCCAG 


GACCTTTTGT 


TCTTTCTGTG 
a v* a ^ *> v^ ^ vj ^ vj 


TGCCATGTAT 

A VJ V* A VJ A *» J* 


GCCCCCATTT 

\J V* V* w W W<* X X X 


600 

V v V 


GTACCATCGA 


TTTCCAGCAT 


GAACCAATTA 


AGCCTTGCAA 


GTCCGTGTGC 

VJ J> V* V* VJ A VJ A VJ Vrf 


GAAAGGGCCA 

VJ A ** ** * VJ VJ VJ V** V^ a* 


660 

W V V 


GGGCCGGCTG 


TGAGCCCATT 


CTCATAAAGT 


ACCGGCACAC 


TTGGCCAGAG 


AGCCTGGCAT 

* * ^J V^ V* *> XJ VJ V^ * * X 


720 


GTGAAGAGCT 


GCCCGTATAT 


GACAGAGGAG 


TCTGCATCTC 

A Vrf- A VJ V** *A A *> V«« 


CCCAGAGGCT 

V^ V*» V^ ** VJ ** VJ VJ Vp^ X 


ATCGTCACAG 

** X V» VJ x v*nw**vj 


780 


TGGAACAAGG 


AACAGATTCA 


ATGCCAGACT 


TCTCCATGGA 
\* a w v^*r* a vj vj** 


TTCAAACAAT 

X A. V* ****** V* A 


GGAAATTGCG 

WWiUul* X VJ W VJ 


840 


GAAGCGGCAG 


GGAGCACTGT 


AAATGCAAGC 


CCATGAAGGC 

w V^ ** X wfuly vw 


AACCCAAAAG 

«U*W W WtVUVlw 


ACGTATCTCA 

** V* VJ X ** X Nmt X \*t% 


900 


AGAATAATTA 


CAATTATGTA 


ATCAGAGCAA 


AAGTGAAAGA 

*TU*VJ X VJ *»-»*** VJ ** 


GGTGAAAGTG 

X VJ«»X*i* w X VJ 


AAATGPPArG 

«**Vfi X yWWnwVJ 


960 

^ u u 


ACGCAACAGC 


AATTGTGGAA 


GTAAAGGAGA 


TTCTCAAGTC 

A A V«> A. V^**JT*VJ X w 


TTCCCTAGTG 

X X W w w X «*^J X \J 


AACATTCCTA 

xvnwn x x v>\« x x* 


1020 


AAGACACAGT 


GACACTGTAC 


ACCAACTCAG 


GCTGCTTGTG 

VW X Vj V*i A A Vj A Vj 


CCCCCAGCTT 

\^V«V«\« W**WW X X 


GTTGCCAATG 

>J A 1 VJWVtf XkJT* X VJ 


1080 

X V w w 


AfifiAATArAT 


AATTATftfiGf 1 


T ATGAAG AC A 

X 1* X WUWJCW* ** 


AAGAGCGTAC 


CAGGCTTCTA 

v«nvj x x w x ** 


CTAGTGGAAG 

v- X i*VJ X V9V3/\/W3 


1140 

X X t V 


GATCCTTGGC 


CGAAAAATGG 


AGAGATCGTC 


TTGCTAAGAA 


AGTCAAGCGC 


TGGGATCAAA 


1200 


AGCTTCGACG 


TCCCAGGAAA 


AGCAAAGACC 


CCGTGGCTCC 


AATTCCCAAC 


AAAAACAGCA 


1260 


ATTCCAGACA 


AGCGCGTAGT 


TAGACTAACG 


GAAAGGTGTA 


TGGAAACTCT 


ATGGACTTTG 


1320 


AAACTAAGAT 


TTGCATTGTT 


GGAAGAGCAA 


AAAAGAAATT 


GCACTACAGC 


ACGTTATATT 


1380 


CTATTGTTTA 


CTACAAGAAG 


CTGGTTTAGT 


TGATTGTAGT 


TCTCCTTTCC 


TTCTTTTTTT 


1440 
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TTATAACTAT ATTTGCACGT GTTCCCAGGC AATTGTTTTA TTCAACTTCC AGTGACAGAG 1500 

CAGTGACTGA ATGTCTCAGC CTAAAGAAGC TCAATTCATT TCTGATCAAC TAATGGTGAC 1560 

AAGTGTTTGA TACTTGGGGA AAGTGAACTA ATTGCAATGG TAAATCAGAG AAAAGTTGAC 1620 

CAATGTTGCT TTTCCTGTAG ATGAACAAGT GAGAGATCAC ATTTAAATGA TGATCACTTT 1680 

CCATTTAATA CTTTCAGCAG TTTTAGTTAG ATGACATGTA GGATGCACCT AAATCTAAAT 1740 

ATTTTATCAT AAATGAAGAG CTGGTTTAGA CTGTATGGTC ACTGTTGGGA AGGTAAATGC 1800 

CTACTTTGTC AATTCTGTTT TAAAAATTGC CTAAATAAAT ATTAAGTCCT AAATAAAAAA 1860 

AAAAAAAAAA AAAAA 1875 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 979 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Leu Leu Leu Phe Arg Ala lie Pro Met Leu Leu Leu Gly Leu Met 
15 10 15 

Val Leu Gin Thr Asp Cys Glu lie Ala Gin Tyr Tyr lie Asp Glu Glu 
20 25 30 

Glu Pro Pro Gly Thr Val lie Ala Val Leu Ser Gin His Ser lie Phe 
35 40 45 

Asn Thr Thr Asp lie Pro Ala Thr Asn Phe Arg Leu Met Lys Gin Phe 
50 55 60 

Asn Asn Ser Leu lie Gly Val Arg Glu Ser Asp Gly Gin Leu Ser lie 
65 70 75 80 

Met Glu Arg lie Asp Arg Glu Gin lie Cys Arg Gin Ser Leu His Cys 

85 90 95 

Asn Leu Ala Leu Asp Val Val Ser Phe Ser Lys Gly His Phe Lys Leu 
100 105 110 
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Leu Asn val 
115 

Phe Pro Ser 
130 

Gly Thr Arg 
145 



Lys Val Glu Val Arg Asp lie Asn Asp His Ser Pro His 

120 125 

Glu He Met His Val Glu Val Ser Glu Ser Ser Ser Val 
135 140 

He Pro Leu Glu lie Ala He Asp Glu Asp Val Gly Ser 

150 155 160 



Asn Ser He Gin Asn Phe Gin He Ser Asn Asn Ser His Phe Ser He 

165 170 175 

Asp Val Leu Thr Arg Ala Asp Gly Val Lys Tyr Ala Asp Leu Val Leu 
180 185 190 



Met Arg Glu 

195 

Leu Ala Met 
210 

Asn He Arg 
225 



Leu Asp Arg Glu He Gin Pro Thr Tyr He Met Glu Leu 

200 205 

Asp Gly Gly Val Pro Ser Leu Ser Gly Thr Ala Val Val 

215 220 

Val Leu Asp Phe Asn Asp Asn Ser Pro Val Phe Glu Arg 
230 235 240 



Ser Thr He Ala Val Asp Leu Val Glu Asp Ala Pro Leu Gly Tyr Leu 

245 250 255 

Leu Leu Glu Leu His Ala Thr Asp Asp Asp Glu Gly Val Asn Gly Glu 

260 265 270 



He Val Tyr 

275 

Phe Lys lie 
290 

Asp Phe Glu 
305 

Leu Gly Pro 
Leu Asp Val 



Thr Val Asn 
355 

Asn Phe He 
370 



Gly Phe Ser Thr Leu Ala Ser Gin Glu Val Arg Gin Leu 

280 285 

Asn Ser Arg Thr Gly Ser Val Thr Leu Glu Gly Gin Val 
295 300 

Thr Lys Gin Thr Tyr Glu Phe Glu Val Gin Ala Gin Asp 
310 315 320 

Asn Pro Leu Thr Ala Thr Cys Lys val Thr val His He 
325 330 335 

Asn Asp Asn Thr Pro Ala He Thr He Thr Pro Leu Thr 
340 345 350 

Ala Gly Val Ala Tyr He Pro Glu Thr Ala Thr Lys Glu 

360 365 

Ala Leu He Ser Thr Thr Asp Arg Ala Ser Gly Ser Asn 
375 380 
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Gly Gin Val Arg Cys Thr Leu Tyr Gly His Glu His Phe Lys Leu Gin 
385 390 395 400 

Gin Ala Tyr Glu Asp Ser Tyr Met He Val Thr Thr Ser Thr Leu Asp 

405 410 415 

Arg Glu Asn He Ala Ala Tyr Ser Leu Thr Val Val Ala Glu Asp Leu 
420 425 430 

Gly Phe Pro Ser Leu Lys Thr Lys Lys Tyr Tyr Thr Val Lys val Ser 
435 440 445 

Asp Glu Asn Asp Asn Ala Pro Val Phe Ser Lys Pro Gin Tyr Glu Ala 
450 455 460 

Ser He Leu Glu Asn Asn Ala Pro Gly Ser Tyr He Thr Thr Val He 
465 470 475 480 

Ala Arg Asp Ser Asp Ser Asp Gin Asn Gly Lys Val Asn Tyr Arg Leu 

485 490 495 

Val Asp Ala Lys Val Met Gly Gin Ser Leu Thr Thr Phe Val Ser Leu 
500 505 510 

Asp Ala Asp Ser Gly Val Leu Arg Ala Val Arg Ser Leu Asp Tyr Glu 
515 520 525 

Lys Leu Lys Gin Leu Asp Phe Glu He Glu Ala Ala Asp Asn Gly He 
530 535 540 

Pro Gin Leu Ser Thr Arg Val Gin Leu Asn Leu Arg He Val Asp Gin 
545 550 555 560 

Asn Asp Asn Cys Pro Val He Thr Asn Pro Leu Leu Asn Asn Gly Ser 

565 570 575 

Gly Glu Val Leu Leu Pro He Ser Ala Pro Gin Asn Tyr Leu Val Phe 
580 585 590 

Gin Leu Lys Ala Glu Asp Ser Asp Glu Gly His Asn Ser Gin Leu Phe 
595 600 605 

Tyr Thr He Leu Arg Asp Pro Ser Arg Leu Phe Ala He Asn Lys Glu 
610 615 620 

Ser Gly Glu Val Phe Leu Lys Lys Gin Leu Asn Ser Asp His Ser Glu 
625 630 635 640 

Asp Leu Ser He Val Val Ala Val Tyr Asp Leu Gly Arg Pro Ser Leu 



645 



650 



655 
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Ser Thr Asn Ala Thr Val Lys Phe lie Leu Thr Asp Ser Phe Pro Ser 
660 665 670 

Asn Val Glu Val Val lie Leu Gin Pro Ser Ala Glu Glu Gin His Gin 
675 680 685 

He Asp Met Ser He He Phe He Ala Val Leu Ala Gly Gly Cys Ala 
690 695 700 

Leu Leu Leu Leu Ala He Phe Phe Val Ala Cys Thr Cys Lys Lys Lys 
705 710 715 720 

Ala Gly Glu Phe Lys Gin Val Pro Glu Gin His Gly Thr Cys Asn Glu 

725 730 735 

Glu Arg Leu Leu Ser Thr Pro Ser Pro Gin Ser Val Ser Ser Ser Leu 
740 745 750 

Ser Gin Ser Glu Ser Cys Gin Leu Ser He Asn Thr Glu Ser Glu Asn 
755 760 765 

Cys Ser Val Ser Ser Asn Gin Glu Gin His Gin Gin Thr Gly He Lys 
770 775 780 

His Ser He Ser Val Pro Ser Tyr His Thr Ser Gly Trp His Leu Asp 
785 790 795 800 

Asn Cys Ala Met Ser He Ser Gly His Ser His Met Gly His He Ser 

805 810 815 

Thr Lys Asp Ser Gly Lys Gly Asp Ser Asp Phe Asn Asp Ser Asp Ser 
820 825 830 

Asp Thr Ser Gly Glu Ser Gin Lys Lys Ser He Glu Gin Pro Met Gin 
835 840 845 

Ala Gin Ala Ser Ala Gin Tyr Thr Asp Glu Ser Ala Gly Phe Arg His 
850 855 860 

Ala Asp Asn Tyr Phe Ser His Arg He Asn Lys Gly Pro Glu Asn Gly 
865 870 875 880 

Asn Cys Thr Leu Gin Tyr Glu Lys Gly Tyr Arg Leu Ser Tyr Ser Val 

885 890 895 

Ala Pro Ala His Tyr Asn Thr Tyr His Ala Arg Met Pro Asn Leu His 
900 905 910 

He Pro Asn His Thr Leu Arg Asp Pro Tyr Tyr His He Asn Asn Pro 
915 920 925 
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Val Ala Asn Arg Met His Ala Glu 
'930 935 

Ser Ala Thr Leu Ser Pro Gin Arg 
945 950 

Asn Tyr Ser Pro Gin lie Ser Arg 

965 

Thr Thr Phe 




Tyr Glu Arg Asp Leu val Asn Arg 
940 

Ser Ser Ser Arg Tyr Gin Glu Phe 
955 960 

Gin Leu His Pro Ser Glu lie Ala 
970 975 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GAATTCCCAG AGATGAACTC CTTGAGATTG TTTTAAATGA CTGCAGGTCT GGAAGGATTC 60 

ACATTGCCAC ACTGTTTCTA GGCATGAAAA AACTGCAAGT TTCAACTTTG TTTTTGGTGC 120 

AACTTTGATT CTTCAAGATG CTGCTTCTCT TCAGAGCCAT TCCAATGCTG CTGTTGGGAC 180 

TGATGGTTTT ACAAACAGAC TGTGAAATTG CCCAGTACTA CATAGATGAA GAAGAACCCC 240 

CTGGCACTGT AATTGCAGTG TTGTCACAAC ACTCCATATT TAACACTACA GATATACCTG 300 

CAACCAATTT CCGTCTAATG AAGCAATTTA ATAATTCCCT TATCGGAGTC CGTGAGAGTG 360 

ATGGGCAGCT GAGCATCATG GAGAGGATTG ACCGGGAGCA AATCTGCAGG CAGTCCCTTC 420 

ACTGCAACCT GGCTTTGGAT GTGGTCAGCT TTTCCAAAGG ACACTTCAAG CTTCTGAACG 480 

TGAAAGTGGA GGTGAGAGAC ATTAATGACC ATAGCCCTCA CTTTCCCAGT GAAATAATGC 540 

ATGTGGAGGT GTCTGAAAGT TCCTCTGTGG GCACCAGGAT TCCTTTAGAA ATTGCAATAG 600 

ATGAAGATGT TGGGTCCAAC TCCATCCAGA ACTTTCAGAT CTCAAATAAT AGCCACTTCA 660 

GCATTGATGT GCTAACCAGA GCAGATGGGG TGAAATATGC AGATTTAGTC TTAATGAGAG 720 

AACTGGACAG GGAAATCCAG CCAACATACA TAATGGAGCT ACTAGCAATG GATGGGGGTG 780 
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w 












TACCATCACT 


ATCTGGTACT 


GCAGTGGTTA 


ACATCCGAGT 


CCTGGACTTT 


AATGATAACA 


840 


GCCCAGTGTT 


TGAGAGAAGC 


ACCATTGCTG 


TGGACCTAGT 


AGAGGATGCT 


CCTCTGGGAT 


900 


ACCTTTTGTT 


GGAGTTACAT 


GCTACTGACG 


ATGATGAAGG 


AGTGAATGGA 


GAAATTGTTT 


960 


ATGGATTCAG 


CACTTTGGCA 


TCTCAAGAGG 


TACGTCAGCT 


ATTTAAAATT 


AACTCCAGAA 


1020 


CTGGCAGTGT 


TACTCTTGAA 


GGCCAAGTTG 


ATTTTGAGAC 


CAAGCAGACT 


TACGAATTTG 


1080 


AGGTACAAGC 


CCAAGATTTG 


GGCCCCAACC 


CACTGACTGC 


TACTTGTAAA 


GTAACTGTTC 


1140 


ATATACTTGA 


TGTAAATGAT 


AATACCCCAG 


CCATCACTAT 


TACCCCTCTG 


ACTACTGTAA 


1200 


ATGCAGGAGT 


TGCCTATATT 


CCAGAAACAG 


CCACAAAGGA 


GAACTTTATA 


GCTCTGATCA 


1260 


GCACTACTGA 


CAGAGCCTCT 


GGATCTAATG 


GACAAGTTCG 


CTGTACTCTT 


TATGGACATG 


1320 


AGCACTTTAA 


ACTACAGCAA 


GCTTATGAGG 


ACAGTTACAT 


GATAGTTACC 


ACCTCTACTT 


1380 


TAGACAGGGA 


AAACATAGCA 


GCGTACTCTT 


TGACAGTAGT 


TGCAGAAGAC 


CTTGGCTTCC 


1440 


CCTCATTGAA 


GACCAAAAAG 


TACTACACAG 


TCAAGGTTAG 


TGATGAGAAT 


GACAATGCAC 


1500 


CTGTATTTTC 


TAAACCCCAG 


TATGAAGCTT 


CTATTCTGGA 


AAATAATGCT 


CCAGGCTCTT 


1560 


ATATAACTAC 


AGTGATAGCC 


AGAGACTCTG 


ATAGTGATCA 


AAATGGCAAA 


GTAAATTACA 


1620 


GACTTGTGGA 


TGCAAAAGTG 


ATGGGCCAGT 


CACTAACAAC 


ATTTGTTTCT 


CTTGATGCGG 


1680 


ACTCTGGAGT 


ATTGAGAGCT 


GTTAGGTCTT 


TAGACTATGA 


AAAACTTAAA 


CAACTGGATT 


1740 


TTGAAATTGA 


AGCTGCAGAC 


AATGGGATCC 


CTCAACTCTC 


CACTCGCGTT 


CAACTAAATC 


1800 


TCAGAATAGT 


TGATCAAAAT 


GATAATTGCC 


CTGTGATAAC 


TAATCCTCTT 


CTTAATAATG 


1860 


GCTCGGGTGA 


AGTTCTGCTT 


CCCATCAGCG 


CTCCTCAAAA 


CTATTTAGTT 


TTCCAGCTCA 


1920 


AAGCCGAGGA 


TTCAGATGAA 


GGGCACAACT 


CCCAGCTGTT 


CTATACCATA 


CTGAGAGATC 


1980 


CAAGCAGATT 


GTTTGCCATT 


AACAAAGAAA 


GTGGTGAAGT 


GTTCCTGAAA 


AAACAATTAA 


2040 


ACTCTGACCA 


TTCAGAGGAC 


TTGAGCATAG 


TAGTTGCAGT 


GTATGACTTG 


GGAAGACCTT 


2100 


CATTATCCAC 


CAATGCTACA 


GTTAAATTCA 


TCCTCACCGA 


CTCTTTTCCT 


TCTAACGTTG 


2160 


AAGTCGTTAT 


TTTGCAACCA 


TCTGCAGAAG 


AGCAGCACCA 


GATCGATATG 


TCCATTATAT 


2220 


TCATTGCAGT 


GCTGGCTGGT 


GGTTGTGCTT 


TGCTACTTTT 


GGCCATCTTT 


TTTGTGGCCT 


2280 


GTACTTGTAA 


AAAGAAAGCT 


GGTGAATTTA 


AGCAGGTACC 


TGAACAACAT 


GGAACATGCA 


2340 
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ATGAAGAACG 


CCTGTTAAGC 


ACCCCATCTC 


CCCAGTCGGT 


CTCTTCTTCT 


TTGTCTCAGT 


2400 


CTGAGTCATG 


CCAACTCTCC 


ATCAATACTG 


AATCTGAGAA 


TTGCAGCGTG 


TCCTCTAACC 


2460 


AAGAGCAGCA 


TCAGCAAACA 


GGCATAAAGC 


ACTCCATCTC 


TGTACCATCT 


TATCACACAT 


2520 


CTGGTTGGCA 


CCTGGACAAT 


TGTGCAATGA 


GCATAAGTGG 


ACATTCTCAC 


ATGGGGCACA 


2580 


TTAGTACAAA 


GGACAGTGGC 


AAAGGAGATA 


GTGACTTCAA 


TGACAGTGAC 


TCTGATACTA 


2640 


GTGGAGAATC 


ACAAAAGAAG 


AGCATTGAGC 


AGCCAATGCA 


GGCACAAGCC 


AGTGCTCAAT 


2700 


ACACAGATGA 


ATCAGCAGGG 


TTCCGACATG 


CCGATAACTA 


TTTCAGCCAC 


CGAATCAACA 


2760 


AGGGTCCAGA 


AAATGGGAAC 


TGCACATTGC 


AATATGAAAA 


GGGCTATAGA 


CTGTCTTACT 


2820 


CTGTAGCTCC 


TGCTCATTAC 


AATACCTACC 


ATGCAAGAAT 


GCCTAACCTG 


CACATACCGA 


2880 


ACCATACCCT 


TAGAGACCCT 


TATTACCATA 


TCAATAATCC 


TGTTGCTAAT 


CGGATGCACG 


2940 


CGGAATATGA 


AAGAGATTTA 


GTCAACAGAA 


GTGCAACGTT 


ATCTCCGCAG 


AGATCGTCTA 


3000 


GCAGATACCA 


AGAATTCAAT 


TACAGTCCGC 


AGATATCAAG 


ACAGCTTCAT 


CCTTCAGAAA 


3060 


TTGCTACAAC 


CTTTTAATCA 


TTAGGCATGC 


AAGTGAGAAT 


GCACAAAGGC 


AAGTGCTTTA 


3120 


GCATGAAAGC 


TAAATATATG 


GAGTCTCCCC 


TTTCCCTCTG 


ATGGATGGGG 


GGAGACACAG 


3180 


GACAGTGCAT 


AAATATACAG 


CTGCTTTCTA 


TTTGCATTTC 


ACTTGGGAAT 


TTTTTGTTTT 


3240 


TTTTACATAT 


TTATTTTTCC 


TGAATTGAAT 


GTGACATTGT 


CCTGTCACCT 


AACTAGCAAT 


3300 


TAAATCCACA 


GACCTACAGT 


CAAATATTTG 


AGGGCCCCTG 


AAACAGCACA 


TCAGTCAGGA 


3360 


CCTAAAGTGG 


CCTTTTTACT 


TTTAGCAGCT 


CCTGGGTCTG 


CCCTCTGTGT 


TAATCAGCCC 


3420 


LTlivjI LAAbl 


1 VjAlj 1 Ativj 




111 lAlAlbL 


Ai^l l_/\t_v_ 1 A 


till oVjrllrVj 1 


J n 0 v 


GATTTACACA 


TAATAGGAAA 


CGCTTGGTTT 


CAGTGAAGTC 


TGTGTTGTAT 


ATATTCTGTT 


3540 


ATATACACGC 


ATTTTGTGTT 


TGTGTATATA 


TTTCAAGTCC 


ATTCAGATAT 


GTGTATATAG 


3600 


TGCAGACCTT 


GTAAATTAAA 


TATTCTGATA 


CTTTTTCCTC 


AATAAATATT 


TAAAT 


3655 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi\ SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Val Cys Cys Gly Pro Gly Arg Met Leu Leu Gly Trp Ala Gly Leu 
15 10 15 

Leu Val Leu Ala Ala Leu Cys Leu Leu Gin Val Pro Gly Ala Gin Ala 
20 25 30 

Ala Ala Cys &lu Pro val Arg lie Pro Leu Cys Lys Ser Leu Pro Trp 
35 40 45 

Asn Met Thr Lys Met Pro Asn His Leu His His Ser Thr Gin Ala Asn 
50 55 60 

Ala lie Leu Ala Met Glu Gin Phe Glu Gly Leu Leu Gly Thr His Cys 
65 70 75 80 

Ser Pro Asp Leu Leu \Phe Phe Leu Cys Ala Met Tyr Ala Pro lie Cys 

85 \ 90 95 

Thr lie Asp Phe Gin His Glu Pro lie Lys Pro Cys Lys Ser Val Cys 
100 105 110 

Glu Arg Ala Arg Gin Gly Cys Glu Pro lie Leu lie Lys Tyr Arg His 
115 120 125 

Ser Trp Pro Glu Ser Leu Ala Cys Asp Glu Leu Pro Val Tyr Asp Arg 
130 135 \ 140 

Gly Val Cys lie Ser Pro Glu A^a lie Val Thr Ala Asp Gly Ala Asp 
145 150 155 160 

Phe Pro Met Asp Ser Ser Thr Gly His Cys Arg Gly Ala Ser Ser Glu 

165 170 175 

Arg Cys Lys Cys Lys Pro Val Arg Ala Thr Gin Lys Thr Tyr Phe Arg 
180 185 190 

Asn Asn Tyr Asn Tyr Val lie Arg Ala Lys Val Lys Glu Val Lys Met 
195 200 205 

Lys Cys His Asp Val Thr Ala Val Val Glu Val Lys Glu lie Leu Lys 
210 215 220 

Ala Ser Leu Val Asn lie Pro Arg Asp Thr Val Asn Leu Tyr Thr Thr 
225 230 235 240 
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Ser Gly Cys Leu Cys Pro Pro Leu 

245 

Met Gly Tyr Glu Asp Glu Glu Arg 
260 

Ser lie Ala Glu Lys Trp Lys Asp 
275 280 

Trp Asp Met Lys Leu Arg His Leu 
290 295 

Asp Ser Thr Gin Asn Gin Lys Ser 
305 310 

Ala Arg Ser 




Thr Val Asn Glu Glu Tyr Val lie 
250 255 

Ser Arg Leu Leu Leu Val Glu Gly 
265 270 

Arg Leu Gly Lys Lys Val Lys Arg 

285 

Gly Leu Gly Lys Thr Asp Ala Ser 
300 

Gly Arg Asn Ser Asn Pro Arg Pro 
315 320 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

AAGCCTGGGA CCATGGTCTG CTGCGGCCCG GGACGGATGC TGCTAGGATG GGCCGGGTTG 60 

CTAGTCCTGG CTGCTCTCTG CCTGCTCCAG GTGCCCGGAG CTCAGGCTGC AGCCTGTGAG 120 

CCTGTCCGCA TCCCGCTGTG CAAGTCCCTT CCCTGGAACA TGACCAAGAT GCCCAACCAC 180 

CTGCACCACA GCACCCAGGC TAACGCCATC CTGGCCATGG AACAGTTCGA AGGGCTGCTG 240 

GGCACCCACT GCAGCCCGGA TCTTCTCTTC TTCCTCTGTG CAATGTACGC ACCCATTTGC 300 

ACCATCGACT TCCAGCACGA GCCCATCAAG CCCTGCAAGT CTGTGTGTGA GCGCGCCCGA 360 

CAGGGCTGCG AGCCCATTCT CATCAAGTAC CGCCACTCGT GGCCGGAAAG CTTGGCCTGC 420 

GACGAGCTGC CGGTGTACGA CCGCGGCGTG TGCATCTCTC CTGAGGCCAT CGTCACCGCG 4 80 

GACGGAGCGG ATTTTCCTAT GGATTCAAGT ACTGGACACT GCAGAGGGGC AAGCAGCGAA 540 

CGTTGCAAAT GTAAGCCTGT CAGAGCTACA CAGAAGACCT ATTTCCGGAA CAATTACAAC 600 
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W 






w 






TATGTCATCC 


GGGCTAAAGT 


TAAAGAGGTA 


AAGATGAAAT 


GTCATGATGT 


GACCGCCGTT 


660 


GTGGAAGTGA 


AGGAAATTCT 


AAAGGCATCA 


CTGGTAAACA 


TTCCAAGGGA 


CACCGTCAAT 


720 


CTTTATACCA 


CCTCTGGCTG 


CCTCTGTCCT 


CCACTTACTG 


TCAATGAGGA 


ATATGTCATC 


780 


ATGGGCTATG 


AAGACGAGGA 


ACGTTCCAGG 


TTACTCTTGG 


TAGAAGGCTC 


TATAGCTGAG 


840 


AAGTGGAAGG 


ATCGGCTTGG 


TAAGAAAGTC 


AAGCGCTGGG 


ATATGAAACT 


CCGACACCTT 


900 


GGACTGGGTA 


AAACTGATGC 


TAGCGATTCC 


ACTCAGAATC 


AGAAGTCTGG 


CAGGAACTCT 


960 


AATCCCCGGC 


CAGCACGCAG 


CTAAATCCTG 


AAATGTAAAA 


GGCCACACCC 


ACGGACTCCC 


1020 


TTCTAAGACT 


GGCGCTGGTG 


GACTAACAAA 


GGAAAACCGC 


ACAGTTGTGC 


TCGTGACCGA 


1080 


TTGTTTACCG 


CAGACACCGC 


GTGGCTACCG 


AAGTTACTTC 


CGGTCCCCTT 


TCTCCTGCTT 


1140 


CTTAATGGCG 


TGGGGTTAGA 


TCCTTTAATA 


TGTTATATAT 


TCTGTTTCAT 


CAATCACGTG 


1200 


GGGACTGTTC 


TTTTGCAACC 


AGAATAGTAA 


ATTAAATATG 


TTGATGCTAA 


GGTTTCTGTA 


1260 


CTGGACTCCC 


TGGGTTTAAT 


TTGGTGTTCT 


GTACCCTGAT 


TGAGAATGCA 


ATGTTTCATG 


1320 


TAAAGAGAGA 


ATCCTGGTCA 


TATCTCAAGA 


ACTAGATATT 


GCTGTAAGAC 


AGCCTCTGCT 


1380 


GCTGCGCTTA 


TAGTCTTGTG 


TTTGTATGCC 


TTTGTCCATT 


TCCCTCATGC 


TGTGAAAGTT 


1440 


ATACATGTTT 


ATAAAGGTAG 


AACGGCATTT 


TGAAATCAGA 


CACTGCACAA 


GCAGAGTAGC 


1500 


CCAACACCAG 


GAAGCATTTA 


TGAGGAAACG 


CCACACAGCA 


TGACTTATTT 


TCAAGATTGG 


1560 


CAGGCAGCAA 


AATAAATAGT 


GTTGGGAGCC 


AAGAAAAGAA 


TATTTTGCCT 


GGTTAAGGGG 


1620 


CACACTGGAA 


TCAGTAGCCC 


TTGAGCCATT 


AACAGCAGTG 


TTCTTCTGGC 


AAGTTTTTGA 


1680 


TTTGTTCATA 


AATGTATTCA 


CGAGCATTAG 


AGATGAACTT 


ATAACTAGAC 


ATCTGTTGTT 


1740 


ATCTCTATAG 


CTCTGCTTCC 


TTCTAAATCA 


AACCCATTGT 


TGGATGCTCC 


CTCTCCATTC 


1800 


ATAAATAAAT 


TTGGCTTGCT 


GTATTGGCCA 


GGAAAAGAAA 


GTATTAAAGT 


ATGCATGCAT 


1860 


GTGCACCAGG 


GTGTTATTTA 


ACAGAGGTAT 


GTAACTCTAT 


AAAAGACTAT 


AATTTACAGG 


1920 


ACACGGAAAT 


GTGCACATTT 


GTTTACTTTT 


TTTCTTCCTT 


TTGCTTTGGG 


CTTGTGATTT 


1980 


TGGTTTTTGG 

A AAA A A 


TGTGTTTATG 


TCTGTATTTT 


GGGGGGTGGG 


TAGGTTTAAG 


CCATTGCACA 


2040 


TTCAAGTTGA 


ACTAGATTAG 


AGTAGACTAG 


GCTCATTGGC 


CTAGACATTA 


TGATTTGAAT 


2100 
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TTGTGTTGTT TAATGCTCCA TCAAGATGTC TAATAAAAGG AATATGGTTG TCAACAGAGA 2160 
CGACAACAAC AACAAA 2176 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Met Val Cys Gly Ser Pro Gly Gly Met Leu Leu Leu Arg Ala Gly Leu 
15 10 15 

Leu Ala Leu Ala Ala Leu Cys Leu Leu Arg Val Pro Gly Ala Arg Ala 
20 25 30 

Ala Ala Cys Glu Pro val Arg He Pro Leu Cys Lys Ser Leu Pro Trp 
35 40 45 

Asn Met Thr Lys Met Pro Asn His Leu His His Ser Thr Gin Ala Asn 
50 55 60 

Ala He Leu Ala He Glu Gin Phe Glu Gly Leu Leu Gly Thr His Cys 
65 70 75 80 

Ser Pro Asp Leu Leu Phe Phe Leu Cys Ala Met Tyr Ala Pro He Cys 

85 90 95 

Thr He Asp Phe Gin His Glu Pro He Lys Pro Cys Lys Ser Val Cys 
100 105 110 

Glu Arg Ala Arg Gin Gly Cys Glu Pro He Leu He Lys Tyr Arg His 
115 120 125 

Ser Trp Pro Glu Asn Leu Ala Cys Glu Glu Leu Pro Val Tyr Asp Arg 
130 135 140 

Glv Val Cvs He Ser Pro Glu Ala He Val Thr Ala Asp Gly Ala Asp 
145 150 155 160 

Phe Pro Met Asp Ser Ser Asn Gly Asn Cys Arg Gly Ala Ser Ser Glu 

165 170 175 
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Arg Cys Lys 



Asn Asn Tyr 
195 

Lys Cys His 
210 

Ser Ser Leu 
225 

Ser Gly Cys 



Met Gly Tyr 



Ser lie Ala 
275 

Trp Asp Met 
290 

Asn Ser Asp 
305 

Arg Gin Ala 



Cys Lys 
180 

Asn Tyr 

Asp Val 

Val Asn 

Leu Cys 
245 

Glu Asp 
260 

Glu Lys 
Lys Leu 
Ser Thr 



Pro lie 

Val He 

Thr Ala 
215 

He Pro 
230 

Pro Pro 
Glu Glu 
Trp Lys 



Arg His 
2$ 5 

Gin Se£ 
310 



Arg Ala Thr 
185 

Arg Ala Lys 
200 

val Val Glu 

Arg Asp Thr 

Leu Asn Val 
250 

Arg Ser Arg 
265 

Asp Arg Leu 
280 

Leu Gly Leu 
Gin Lys Ser 



Gin Lys Thr Tyr Phe Arg 
190 

Val Lys Glu He Lys Thr 
205 

Val Lys Glu He Leu Lys 
220 

Val Asn Leu Tyr Thr Ser 
235 240 

Asn Glu Glu Tyr He He 

255 

Leu Leu Leu Val Glu Gly 
270 

Gly Lys Lys Val Lys Arg 
285 

Ser Asp Ser Ser 



Ser Lys 
300 

Gly Arg 
315 



Asn Ser Asn Pro 
320 



Arg Asn 
325 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1893 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGCGGAGCGG GCCTTTTGGC GTCCACTGCG CGGCTGCACC CTGCCCCATC TGCCGGGATC 60 

ATGGTCTGCG GCAGCCCGGG AGGGATGCTG CTGCTGCGGG CCGGGCTGCT TGCCCTGGCT 120 

GCTCTCTGCC TGCTCCGGGT GCCCGGGGCT CGGGCTGCAG CCTGTGAGCC CGTCCGCATC 180 

CCCCTGTGCA AGTCCCTGCC CTGGAACATG ACTAAGATGC CCAACCACCT GCACCACAGC 24 0 
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ACTCAGGCCA 


ACGCCATCCT 


GGCCATCGAG 

XVXVX^X^A A A X^ I X^ 


CAGTTCGAAG 

X^ A* XV A A V* XJ • M A X^ 


GTCTGCTGGG 


CACCCACTGC 


300 

•J V V 


AGCCCCGATC 

• AW X* X* X* XV X*A * A X^ 


TGCTCTTCTT 

XV » X* A A X* A A 


CCTCTGTGCC 

X^ X^ A X* A X« A Ww X* 


ATGTACGCGC 

• » A XJ A X* x« x^ x^ x^ 


CCATCTGCAC 

x# X^ • • • X^ • X^ X* * * x^ 


CATTGACTTC 

X^ • • A A X* m m A A X^ 


360 


CAGCACGAGC 


CCATCAAGCC 

x^ • • * ** *x* x^ x^ 


CTGTAAGTCT 

X^ A XV * * ** • Xrf A X* A 


GTGTGCGAGC 

X^ m\ W A XJ X* X* * * X^ X^ 


GGGCCCGGCA 

X^ X# X* X* x^ x^ x^ x^ x^ • * 


GGGCTGTGAG 


420 


CCCATACTCA 

X^X^ X^A^A A A *X* A X^* A 


tcaAgtaccg 

a X^A li A, Xrf A 4 AX^ X* XV 


CCACTCGTGG 

V^ V*> * A V* VJ A VJ U 


CCGGAGAACC 


TGGCCTGCGA 


GGAGCTGCCA 

XV XV *m XV X^ A XV X^ X^ * m 


480 


GTGTACGACA 

VJ A VJ A **V* VJ*1 IV* * 


ggggcgtgtg 

XVXVX*XVX^^f A XV A XV 


CATCTCTCCC 

V^** A V«» A V* A w V* V* 


GAGGCCATCG 


TTACTGCGGA 


CGGAGCTGAT 

X^ XV XV mm XV X^ A XV AAA 


540 


TTTCCTATGG 

AAA V^ V^ A * * A VJ 


attcta&taa 


CGGAAACTGT 

V^ VJ VJAUIAIVm* A VJ A 


AGAGGGGCAA 


GCAGTGAACG 

VJ V* *lw A VJ 


CTGTAAATGT 


600 


AAGCCTATTA 


GAGCTACACA 

x^* *x^ x* a • ax^ataxaaa 


GAAGACCTAT 

VJ*^**VJ**V# V* A *» A 


TTCCGGAACA 


ATTACAACTA 

mm A A iftWiu* v m± m m 


TGTCATTCGG 

A XV A X^ AAA A X* X# X* 


660 


GCTAAAGTTA 


AAG AG AT AAA 


GACTAAGTGC 

x^* as^ A * ** *xv * X* X* 


CATGATGTGA 


CTGCAGTAGT 


GGAGGTGAAG 


720 


GAGATTCTAA 


AGTCCTCTCT 


GGTAAACATT 


CCACGGGACA 


CTGTCAACCT 


CTATACCAGC 


780 


TCTGGCTGCC 


TCTGCCCTCC 


ACTTAATGTT 


AATGAGGAAT 


ATATCATCAT 


GGGCTATGAA 


840 


GATGAGGAAC 


GTTCCAGATT 


ACTCTTGGTG 


GAAGGCTCTA 


TAGCTGAGAA 


GTGGAAGGAT 


900 


CGACTCGGTA 

V^VJ* *w A V* VJVJ A *» 


AAAAAGTTAA 


GCGCTGGGAT 


ATGAAGCTTC 


GTCATCTTGG 

X^ A x^ * * * X^ * * X# XV 


ACTCAGTAAA 

A A X^ A XV A *W A A 1* AA A 


960 


AGTGATTCTA 

A^ AXV A XVJ * A * XV A * * 


GCAATAGTGA 

XVX^A ** A A * iW A X*A A 


TTCCACTCAG 

A A X^X^A * w A XW AXV 


AGTCAGAAGT 

mm X^ m> X^ * * X4 • m>& m X^ A 


CTGGCAGGAA 


CTCGAACCCC 


1020 


CGGCAAGCAC 

x^xvxvx^a AA A XV X*A 


GCAACTAAAT 

XVX^A AA AX^ A * AA AA & A 


CCCGAAATAC 

X^ X^ X^ XV A AA AA A A A A X^ 


AAAAAGTAAC 

* A* m* A. X^ A * * X^ 


ACAGTGGACT 


TCCTATTAAG 


1080 


ACTTACTTGC 

* AX^ A A AAX* * A XV Vrf 


ATTGCTGGAC 

A* A * \J X^ * X* w<4w 


TAGCAAAGGA 

A AiW ViUU^W XV mm 


AAATTGCACT 

A * A A X4 • * X^ m\ 


ATTGCACATC 

m m m± im> X* X* * m x^ * * * X* 


ATATTCTATT 

A A A A A A A X^ A A A A A 


1140 


GTTTACTATA 


AAAATCATGT 


GATAACTGAT 


TATTACTTCT 


GTTTCTCTTT 


TGGTTTCTGC 


1200 


TTCTCTCTTC 


TCTCAACCCC 


TTTGTAATGG 


TTTGGGGGCA 


GACTCTTAAG 


TATATTGTGA 


1260 


GTTTTCTATT 


TCACTAATCA 


TGAGAAAAAC 


TGTTCTTTTG 


CAATAATAAT 


AAATTAAACA 


1320 


TGCTGTTACC 

A xv X^ A XV A * aax^ x^ 


AGAGCCTCTT 


TGCTGAGTCT 

A VJ V-» A WtlV A V* A 


CCAGATGTTA 

X^ X^ XJ A> W A * mm 


ATTTACTTTC 

A* A * *X^ AAA x^ 


TGCACCCCAA 


1380 


TTGGGAATGC 

A A VjVJVJ*»*» A WV 


AATATTGGAT 

• A *» A A VJVJ** A 


GAAAAGAGAG 

VJ4&A*J&J* VJ&l VJ** VJ 


GTTTCTGGTA 

AAA A A A M. 


TTCACAGAAA 


GCTAGATATG 

VJ V* A A * VJ 6 » A A VJ 


1440 


CCTTAAAACA 


TACTCTGCCG 

A «»V* A Vv A VV W VJ 


ATCTAATTAC 

• *AV*A*V»A A<*V* 


AGCCTTATTT 

V* A A •* AAA 


TTGTATGCCT 

A A X# A *• A XV X^ X^ A 


TTTGGGCATT 

AAA VJ VJ VJ V*«* A A 


1500 


CTCCTCATGC 

X* A XV X* A X^A A ^ xvx^ 


TTAGAAAGTT 

A A •*VJ***»*» VJ A A 


CCAAATGTTT 

X^ X^A Uai * * XV 


ATAAAGGTAA 


AATGGCAGTT 


TGAAGTCAAA 

A XV A AA A X# A X* A AV AV A 


1560 


TGTCACATAG 


GCAAAGCAAT 


CAAGCACCAG 


GAAGTGTTTA 


TGAGGAAACA 


ACACCCAAGA 


1620 


TGAATTATTT 


TTGAGACTGT 


CAGGAAGTAA 


AATAAATAGG 


AGCTTAAGAA 


AGAACATTTT 


1680 


GCCTGATTGA 


GAAGCACAAC 


TGAAACCAGT 


AGCCGCTGGG 


GTGTTAATGG 


TAGCATTCTT ■ 


1740 


CTTTTGGCAA 


TACATTTGAT 


TTGTTCATGA 


ATATATTAAT 


CAGCATTAGA 


GAAATGAATT 


1800 
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• 



ATAACTAGAC ATCTGCTGTT ATCACCATAG TTTTGTTTAA TTTGCTTCCT TTTAAATAAA I860 
CCCATTGGTG AAAGTCAAAA AAAAAAAAAA AAA 1893 
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